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Abstract 

In this paper, we propose cylindrical distributions obtained by combining the sine-skewed 
von Mises distribution (circular part) with the Weibull distribution (linear part). This new 
model, the WeiSSVM, enjoys numerous advantages: simple normalizing constant and hence 
very tractable density, parameter-parsimony and interpretability, good circular-linear de¬ 
pendence structure, easy random number generation thanks to known marginal/conditional 
distributions, flexibility illustrated via excellent fitting abilities, and a straightforward exten¬ 
sion to the case of directional-linear data. Inferential issues, such as independence testing, 
circular-linear respectively linear-circular regression, can easily be tackled with our model, 
which we apply on two real data sets. We conclude the paper by discussing future applica¬ 
tions of our model. 

Key words: Circular-linear data, circular-linear regression, distributions on the cylinder, 
sine-skewed von Mises distribution, Weibull distribution 

1 Introduction 

Cylindrical data are observations that consist of a directional part (a set of angles), which is 
often of a circular nature (a single angle), and a linear part (mostly a positive real number). 
This explains the alternative terminology of directional-linear or circular-linear data. Such 
data occur frequently in natural sciences; typical examples are wind direction and another 
climatological variable such as wind speed or air temperature, the direction an animal moves 
and the distance moved, or wave direction and wave height. Recent studies of cylindrical data 
include the exploration of wind direction and S02 concentration (|6j), the analysis of Japanese 
earthquakes ([20]), the link between wildfire orientation and burnt area ( 0 ), and space-time 
modeling of sea currents in the Adriatic Sea am, m- 

A non-trivial yet fundamental problem is the joint modeling of the directional/circular and 
linear variables via the construction of cylindrical probability distributions. The best known 
examples stem from Mardia and Sutton (1978) [15], conditioning from a trivariate normal dis¬ 
tribution, and Johnson and Wehrly (1978) [9j, invoking maximum entropy principles. The latter 
also provide in their paper a general way, based on copulas, to construct circular-linear distri¬ 
butions with specified marginals. 

What desirable properties should a “good” cylindrical distribution possess? It should be 
able to model diverse shapes, in other words present good fitting aptitudes, yet it should ideally 
remain of a tractable form (this is crucial for stochastic properties, estimation purposes, and 
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circular-linear regression) and be parsimonious in terms of parameters at play. The marginal and 
conditional distributions should optimally be well-known and flexible (e.g., there is no reason 
for the circular component to be always symmetric), whilst the dependence structure has to 
take care of a reasonable joint behavior. Indeed, numerous examples of cylindrical data require 
that the circular concentration tends to increase with the linear component, as identified in the 
seminal paper [3]. 

All these conditions are well fulfilled by the new model we propose in the present paper. Its 
probability density function (pdf) is of the form 

(9, x ) (1 + Asin(# — fi)) x a ~ l exp [— (j3x) a (1 — tanh(re) cos(# — //)) 1, (1) 

27rcosh(re) 

where (x,0) € [0, oo) x [—7r, 7 t), a,/3 > 0, —n < fi < tt, re > 0 and — 1 < A < 1. The 
roles of the distinct parameters will be explained in Section [2j as well as the construction 
underpinning (JT]). Stochastic properties such as marginal and conditional distributions, random 
number generation, moment and correlation calculations are presented in Section [3j We will in 
particular stress the capacity of our new density to model cylindrical data with length-increasing 
circular concentration. Maximum likelihood estimation and the ensuing efficient likelihood ratio 
tests (including tests for circular-linear independence) are discussed in Section [4j as well as 
circular-linear and linear-circular regression. The excellent modeling capacities of our new model 
are illustrated by means of two real data sets in Section [5J We conclude the paper by some final 
comments in Section [6j including the straightforward extension of (HD to the higher-dimensional 
directional-linear setting. 

2 A new model for circular-linear data: the WeiSSVM 

Johnson and Wehrly proposed in [9j a very simple distribution able to fit cylindrical data where 
the circular concentration increases with the length of the linear part. Their density reads 

(9, x) i->- - ^ exp \—/3x (1 — tanh(re) cos (9 — //))], (2) 

2tt cosh(re) 

with —7T < /r < 7T, /3 > 0 and re > 0. The linear conditional density is the (negative) exponential, 
while the circular conditional density given X = x is of the form 

9 i—^——— - , , .. exp f/3xtanh(re) cos(fl — u,)] (3) 

2nI 0 (x/3 tanh(re)) F v v P ' J \ j 

where Jo (re) is the modified Bessel function of the first kind and order zero. The mapping ED is 
the popular von Mises density with location ^ and concentration j3x tanh(K), often considered 
as the circular analogue of the normal distribution. We attract the reader’s attention to the 
fact that we have slightly reparameterized the original Johnson-Wehrly parameterization which 
would correspond to using f3 and k\ = /3tanh(«;) instead of f3 and re, and hence adding the 
condition that < (3 in view of /3/ cosh(re) = (/3 2 — re 2 ) 1 / 2 . With our parameterization we avoid 
this condition, which is an advantage for numerical maximization methods. 

A drawback of the Johnson-Wehrly model © is its lack of flexibility. Both its conditional and 
marginal circular densities are symmetric (see Section [3.21 for details), when re = 0 the circular 
contribution in (ED boils down to the uniform law on [—7r,7r), and the circular concentration 
varies linearly with x (see ED). In order to overcome these limitations, we have applied two 
separate transformations to the Johnson-Wehrly density: a power transformation x >—?> x l ^ a 
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for a > 0 to the linear part, and a perturbation of the circular part via multiplication with 
6 i—(1 + Asin(# — y)) for A € [—1,1]. The former is the classical way to turn an exponential 
distribution on M + to the Weibull distribution with pdf x i—?> afix a_1 exp [— fix a ], which is a 
very popular distribution to model diverse natural phenomena. The effect of the perturbation 
is known in circular statistics as “sine-skewing” a reflectively symmetric distribution, see [2], 
Whenever A / 0, the resulting density becomes skewed, whereas symmetry is retrieved for 
A = 0; moreover, the perturbation leaves the normalizing constant untouched. The combined 
effect of both transformations (plus the change from fi to fi a mainly for aesthetic reasons) thus 
yields the pdf 

- a ^ (1 + A sin(# — y)) x 8 " 1 exp [—(fix) 01 (1 — tanh(/c) cos (9 — y))] , 

27r cosIi(k) 

which we term WeiSSVM for the interplay between the linear Weibull part and the circular 
sine-skewed von Mises part. 2D contour plots of the density (JTJ) are given in Figure [Q and show 
the versatility of our new model. 

Parameter interpretation becomes now clear: y and A respectively endorse the role of circular 
location and skewness parameters, while fi and a are linear scale and shape parameters. The 
parameter k plays, as in the original Johnson-Wehrly model, the role of circular concentration 
and circular-linear dependence parameter. Independence is attained when n = 0, in which case 
the density dTJ) becomes the product of the linear Weibull and the circular cardioid distribution 
with location y + 7r/2 and concentration A, see the first row of Figure [1] 

3 Properties of the WeiSSVM 

3.1 The normalizing constant 

As can be seen from (fTj) . the normalizing constant is very simple, which is a strong asset of our 
proposal. Indeed, it is not rare to encounter normalizing constants defined in terms of infinite 
series of functions, as for the Kato-Shimizu model DU. see also (|5|). 

3.2 Marginal and conditional distributions 

The marginal density of the circular component 0 from pdf (H|) is given by 

f(9) = -—— (1 + Asin(0 — y)) / afi a x a ~ 1 exp [— (fix) 01 (1 — tanh(rc) cos(0 — y))] dx 

27tcos1i(k) J o 

1 1 + A sin(0 — /r) 

27 t cosh(«:) 1 — tanh(Tc) cos(# — fi) 

1 — tanh 2 (7c/2) 1 + Asin (6 —/jl) 

27r 1 + tanh 2 (7c/2) — 2tanh(«:/2) cos (9 — y) ’ 

which is the sine-skewed wrapped Cauchy distribution ([2]), a flexible extension of the symmetric 
wrapped Cauchy distribution. 
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(a) (k,A) = (0,0) 


(b) (k,A) = (0,0.5) 
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(g) (K, A) = (1.5,0) 



Length 

(h) (k, A) = (1.5,0.5) 




(c) (k, A) = (0,1) 



(f) (k,A) = (1,1) 



(i) (k, A) = (1.5,1) 



Figure 1: Contour plots of the WeiSSVM density flU over [0,5) x [— n, tv) for (a,/3,/r) = (2,1,0) 
with (a) (k, A) = (0,0) (Weibull and uniform) (b) (k, A) = (0,0.5) (Weibull and cardioid), (c) 
(k, A) = (0,1) (Weibull and cardioid), (d) (k, A) = (1,0), (e) (/c, A) = (1,0.5), (f) (k, A) = (1,1), 
(g) («, A) = (1.5,0), (h) (k, A) = (1.5,0.5) and (i) (k, A) = (1.5,1). 
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The marginal density of the linear component X from pdf Jl]) in turn corresponds to 


/(*) = 


27r cos 
1 


a/3 a x a 1 f (1 + Asin(0 — n)) exp [— ((3x) a (1 — tanh(/c) cos(9 — /r))] dd 
)sn(«) J —7r 

/ 7T 

exp [(/3x) a tanh(re) cos (6 — //)] d6 

-7T 

a/^x^expH^xf], 


27T cosIi(k) 


Io(x a P a tanh(ft)) _ a 


cosIi(k) 


This is an extended version of the marginal density of ((21) given in [9]; as already noticed, it 
simplifies to the Weibull when k = 0. 

The conditional densities from (pQ) are now readily given by 


/(%) 


1 

2irIo(x a /3 a tanh(ft)) 


(1 + A sin(# — //)) exp [(/3x) a tanh(ft) cos(0 — fi)\ 


(4) 


and 


f(x\d) = a /3(1 — tanh(«) cos(0 — /r)) 1,/c 


x a 1 exp 


— ^/3 (1 — tanh(«;) cos (6 — x'j 


(5) 

Both densities are quite common; ([2]) is the sine-skewed von Mises distribution with concen¬ 
tration (/ 3x) a tanh(K) (note how the concentration now varies with x a instead of simply x) 
whereas (O is the Weibull with shape parameter /3 (1 — tanh(«;) cos(0 — . In general the 

conditional density dU) is unimodal, but it can become bimodal when the absolute value of the 
skewness parameter A is large (from our experiment, |A| > 0.918) because of the structure of 
the sine-skewed von Mises distribution, see [2]. 


3.3 Random number generation 

Thanks to the results of the previous section, we can describe a simple random number generation 
algorithm by decomposing f(0,x ) into f(x\6)f(9), in other words, by first generating 0 rsj no) 
and then X|0 = 0 ~ f{x\6). The algorithm goes as follows. 

Step 1: Generate a random variable 0i following a (symmetric) wrapped Cauchy law with location 
H and concentration tanh(«/2), and generate independently U ~ Unif[ 0,1]. 

Step 2: Define 0 as 

f 0i if U < (1 + Asin(0i — fi))/2 
\ —0i if U > (1 + Asin(@i — yu,))/2; 

0 then follows the sine-skewed wrapped Cauchy distribution. 

Step 3: Generate X from a Weibull with shape parameter f3 (1 — tanh(«) cos(0 — . 

Random number generation from sine-skewed distributions follows from general skew-symmetric 
theory on ; see [T9j. 
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3.4 Moment expressions 

The moments of the Weibull distribution and trigonometric moments of the sine-skewed von 
Mises distribution are given explicitly. These nice properties are inherited to our model. 

For n = 1,2,... and m = 1,2,..., we have 


E[AT n cos(m0)] 

- f [ x n cos(m0) (1 + Asin(0)) x a ~ l exp [— (J3x) a (1 — tanh(re) cos(0))l dOdx 

2vr cosh(K) J o J_ n 

-—— / cos (mO) / a/3 a x n x a ~ 1 exp [—(/3x) a (1 — tanh(/c) cos(0))l dxdd 

2vr cosh(K) JJ Q 


2ir cos’ 


i r 

—— / cos (m0) 
)sh(K) J_ v 


T(1 + n/a) 


(3 n (1 — tanh(/t) cos(0)) n// " +1 


d6 


T(n/a + l)(cosh(K)) n / a+1 f n 1 


r 1 

/ —cos (mO)- 

J — 7T 


d6 


cosh (K)/5 n J_ n 2n ( cosh ( K ) 

— sinh(/c) cos(0)) n/,Q+1 
T{n/a + l)(cosh(K)) n / Q r ("A* + 1 - m ) P n/ a ( cosh(«)) 

/ 3 n T(n/a + 1 ) 

where P™(z) is the associated Legendre function of the first kind of degree v and order m given 
by (equation 8.711.2 of [7], p. 969) 


P?(z) = 


-Z0 


/*71 

Jo 


cos mt 


-dt = 


i> + i) 


f 

J — 7T 


cos mt 


71 Jo (z + Vz 2 — 1 cos t) u+1 ttT(u — m + 1) J_ n (z — Vz 2 — 1 cos t) u+1 
Here, we used the relation 

= ELlzL = ( _i r r (" + 1 ) 


dt. 


T(-u) 


T(i/ — m + 1) 


Similarly, 

X n i 
a/3 


E[X n sin(m0)] 

Ot poo /*7T 


27TCOsh(/c) 

A r 

27TCOsh(/c) J_ 
A 


poo pix 

/ / x n sin(m0) (1 + Asin(0)) x a_1 exp [— (/3x) a (1 — tanh(«) cos(0))] dOdx 

Jo J-ir 

r 

(md) sin(0) 

Jo 

/7T 

sin(m0) sin(0) 

-7T 


af3 a x n x a 1 exp [— ((3x) a (1 — tanh(/t) cos(0))] dxdO 
T(n/a + 1) 


/3 n (1 — tanh(fc) cos(0)) n /“ +1 


d9 


27TCOsh(ft) J_ 

AT(n/a + l)(cosh(K))^ n//a+1 ^ f n 1 (cos((m — 1)0) — cos((m + 1)0)) 


cosh(K)/3 r 


—7r 2vr 2 (cosh(«:) — sinh(«;) cos(0)) n ^ a+1 


d6 


Ar(n/ a+ l)(cosh( K )r/°l / r(n/ a + 2- m) 1(coshW) _ rfr/a - m) p „ +1(coshW) 


P ri 


T(n/a + 1) n J 


T(n/a + 1) n /“ 


A(cosh(/y)) n /“ ( r ( n /« + 2 - m ) P ^/a 1 ( cosh ( K )) “ r ( n / a “ m )- P ,Va 1 ( cosh ( K ))) 


(3* 
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Specifying choices for m and n, and noting that the marginal of the circular part is the 
sine-skewed wrapped Cauchy density, we obtain the following simple moment expressions (we 
write P u {z) for P^(z)) 


E[X] 

E[X 2 ] 


(cosh(/t)) 1/,c T + l) 

P 

(cosh(/t)) 2 /“r + l) 

J 2 


-f > l/a(cOsh(K)), 
-P 2 /a( cosh(«)), 


E[cos(0)] = tanh ( —) , E[cos 2 (0)] = - (1 + tanh 2 ( — )) , 


E[sin(0)] 
E[X cos(0)] 

E[X sin(0)] 
E[cos(0) sin(0)] 


A 


2 cosh 2 (|) 


, E[sin 2 (0)] = 


2 cosh 2 (!) ’ 


(cosh( K )) 1 /«r (i) j w ^ 
- -p -E 1/Q (cosh(K)), 


A(cosh( K )) 1 /« (r (£ + 1) Ei/a(cosh(/0) - r (E - 1) P 1 2 /a (cosh( K ))) 


p 

A tanh (!) 
4cosh 2 (f) 


3.5 Circular-linear correlation 

From the moment expressions of the previous section we readily derive the following quantities: 

v2 „ , W 


Var[X] 

Var[cos(0)] 

Var[sin(0)] 

Cov(Af, cos(0)) 

Cov(X, sin(0)) 
Cov(cos(0), sin(0)) 


(cosh(«:)) 2 / a (r + 1) P 2 /a(cosh(K)) - T (^ + l) P 1/a ( cosh(«:)) s 


P 2 


1 


2 cosh 2 (!) 

1 / 


2cosh 2 (!) y 2cosh 2 (!) ’’ 

(cosh (k)) 1 /* (t (E) P-l /a (cosh(K)) - r (i + 1) Pi/a (cosh («)) tanh (§)) 

P ’ 

A(cosh(«:)) 1 /“ (t (i + 1) tanh 2 (f) P 1/a ( cosh(/«)) - T (± - l) P 2 /q (cos1i(k))) 


A tanh (!) 


2/3 


4cosh 2 (!) ' 
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(a) A = 0 (b) A = 0.5 




(c) A = 1 



or 


Figure 2: Contour plots of the circular-linear correlation R. x0 as a function of (a, k) over [0, 7] x 
[0, 5] for (a) A = 0, (b) A = 0.5, (c) A = 1. 


Using these expressions, the correlations are given by 
r xc = Corr(X,cos(0)) 

\/2 cosh (§) (t (£) P 1 1 /a (cosh( K )) - T (£ + l) P 1/a (cosh(/s)) tanh (§)) 

\/ r (a + !) - P 2 /a(cOSh(K)) “ T (^ + l)“ P 1/a (cOsh(K)) 2 
= Corr(W,sin(0)) 

A cosh (§) (T (i + 1) tanh 2 (§) Pi /a (cosh(/s)) — T (^ — l) P 2 /Q (cosh(K))) 

V^\/ r (I + x ) - P 2 / a (cosh(/c)) - T (1 + l) 2 Pi /a (cosh(K)) 2 y^ 2co jf 2 («) 

r cs = Corr(cos(0), sin(0)) 

A tanh (|) 

The circular-linear correlation, which was proposed by HU and |S], can then be obtained via 
the formula 

D 2 _ uL + uL - 2r cs r xc r xs 

U x9 I _ r 2 

' cs 

We see from the above expressions that the circular-linear correlation neither depends on 
the parameter /3 nor on the sign of A. In particular, when A = 0, it simplifies to R xd = r 2 c . The 
influence of a, k and |A| are shown via contour plots in Figure [2j We observe that, for fixed k 
and A, the correlation increases with a. The influence of k goes as follows: at fixed a and A, 
the correlation increases for small values of k until it reaches its maximum, and then decreases. 
This phenomenon is directly inherited from the Johnson-Wehrly construction, as we show in 
Figure [3l 
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Figure 3: Circular-linear correlation for the Johnson-Wehrly model (a = 1 and A = 0) as a 
function of k. 


3.6 A generalization of the WeiSSVM 

A natural generalization of our WeiSSVM model consists in replacing the linear Weibull part 
with the Generalized Gamma distribution m, resulting in the Generalized Gamma sine-skewed 
von Mises (GGSSVM) density 


(6, x) i->- C (1 + A sin(0 — fi)) x a 1 exp [—(/Lc ) 7 (1 — tanh(/c) cos(0 — /r))] , ( 6 ) 


with ct, 7 , /3 > 0, n > 0, —1 < A < 1 and — it < \i < it. The normalizing constant is calculated 
as follows: 



(1 + Asin(# — n))x a 1 exp[—(/3x) 7 (l — tanhfA) cos (0 — ^)))dxd9 



The WeiSSVM clearly corresponds to 7 = a in (0 . All properties of the GGSSVM are obtained 
along the same lines as our developments in the previous sections, albeit with more involved 
calculations. It is to be noted that the circular marginal distribution for the GGSSVM is the 
sine-skewed Jones-Pewsey distribution (see Gama). 

We prefer the WeiSSVM over the GGSSVM because of its simplicity, parameter parsimony, 
higher tractability and its neat link with the Johnson-Wehrly distribution. This explains why 
the main focus of the present paper lies on the WeiSSVM and why we only briefly mention the 
density ([ 6 ]). 

4 Statistical inference 

4.1 Parameter estimation 

Let (8\,xi),... ,(9 n ,x n ) be a sample of n independent and identically distributed couples of 
angular and linear observations drawn from the distribution with density ([T]). Then the log- 
likelihood function can be expressed as 
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( 7 ) 


n n 

£(a, P, fx, k, A) = (a — 1) E log Xi - /3 a E xf (1 — tanh (k) cos ( 0 * — fi)) 

i =1 i=l 

n 

+ E log(l + Asin(#j — /i)) + n(a log /3 + logo; — log(27r cosIi(k))). 
i =1 

The elements of the score vector are just the first-order partial derivatives of (0 with respect 
to each of the parameters: 


d£_ 

da 

d£_ 

dt_ 

dfx 

dl_ 

dn 

m 

d\ 


n n / f\ 

E log Xi - j3 a E log (@Xi)xf (1 - tanh (k) cos (9i — ft)) + n ( log/3 -|— ) 

i =l i=l ^ a ' 


-a/3 a 1 ~ tanh(/t) cos(fli — //)) + ^ 


i =1 


P 

cos (6i — n) 


r tanli(fv) ^ x? 5 i„(«, - «) * A ^ r _ , 

i=l i=l v 

j^a n ^ 

7— w \ \ o E cos ( 6 '* - ^)~ n tanh( K ), 

(cosh(tt)r 

x \ // z —i 

■A sin(6 > i - //) 

“ 1 + Asin(0i - /i)' 

2=1 v 7 


It is difficult to give closed-form expressions for the maximum likelihood estimates (MLEs), 
hence numerical methods should be used to find the solutions. We used the function NMaxi- 
mize in Mathematica where the default numerical maximization algorithm is Nelder-Mead , and 
encountered no problems in the optimization procedure. 


4.2 Submodel and independence testing 

Testing for submodels of the WeiSSVM model is straightforward via likelihood ratio tests. For 
each parameter 7] £ {a, P, //, k, A}, we denote fj the unconstrained maximum likelihood estimate 
and fjo the maximum likelihood estimate under the respective null hypotheses. Two particular 
instances are of interest. On the one hand, testing for the Johnson-Wehrly submodel, which is 
taken care of by the test statistic 

T JW = -2{\og£(l,Po,p 0 ,k 0 ,0) - log£(a,P,p,k,X)), 

rejecting T-Lq : (a = 1)0 (A = 0 ) at asymptotic level t] whenever Tj-yy exceeds x | ;1 , the ??-upper 

quantile of the chi-square distribution with 2 degrees of freedom. On the other hand, we are 
interested in testing for circular-linear independence via the test statistic 

Tfndep = — 2 (logt'(d 0 , Po, po, 0 , A 0 ) - log £[a, P, p, k, A)), 

to be compared with Such tests have a long-standing history in the statistical litera¬ 

ture; see [S] for a recent proposal, based on directional-linear kernel density estimation, and for 
references. 
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4.3 Circular-linear and linear-circular regression 


Since the conditional distributions take a very simple form, the WeiSSVM model lends itself for 
circular-linear as well as linear-circular regression, similarly as in [9]. The mean and variance of 
X given 0 = 0 correspond to 


Epf|© = e\ = 


/3 (1 — tanh(fif) cos(0 — ^)) 1 ^ a V® 


r - + i 


and 


Var[V|0 = 0] = 


1 


+ i -r 


i 


a 


+ 1 


/3 2 (1 — tanh(ft) cos(0 — /r)) 2 /“ \ V a 
respectively. The first mean direction and mean resultant length of 0 given X = x are provided 
by 

P i\x=x = ar S ((fa) a tanh(«) + iX ), 
with arg denoting the argument of a complex number, and 


Pl\X=x 


Ii((/3x) a tanh(«;)) 


(/ 3x) a tanh(K)/o((/3x)“ tanh(«)) 


\J (/ 3x) 2a tanh 2 («) + A 2 , 


respectively. The parameters in each regression model are readily estimated via maximum 
likelihood, see Section HU 


5 Fitting two circular-linear real data sets 


In this section we shall illustrate the good fitting behavior of the WeiSSVM by analyzing two 
popular data sets from the literature. While the first data set reflects exactly the characteris¬ 
tics of data the WeiSSVM is tailor-made for, namely concentration increasing with length and 
circular skewness, these attributes are much less marked in the second data set. This allows 
for a meaningful assessment of the modeling capacities of our new model. In each case, we will 
compare the WeiSSVM with the Johnson-Wehrly distribution, the independence model, and the 
GGSSVM of Section 13.61 Our means of comparison shall be the Akaike Information Criterion 
(AIC) and Bayesian Information Criterion (BIC), and we will apply the tests of Section [4.21 
Besides these alternative distributions, we shall as well indicate a comparison with the 
Mardia-Sutton model [IS] and its recent extension proposed in El, the Kato-Shimizu distribu¬ 
tion. The latter has as density 


fKs(0,x ) = Cexp 


(x-m ) 2 

2ct 2 


+ Kl COs(0 - Hi) + K 2 cos(2(0 - Hz)) , 


( 8 ) 


where —tt < 6 < ir, — oo < x < oo, er > 0, Ki,K 2 > 0, — ir < /Ui < ir, —7^/2 < ^2 < tt/2, 
fi{6) = n' + Acos(0 — v), —00 < fi' < 00 , A > 0, — tt < v < tt and its normalizing constant C is 
provided by 


Ij' (^2) I‘2j (v 1) cos( 2 j ( f_l 1 - H2)) J ■ 

The Mardia-Sutton model is obtained by setting K 2 = 0 in (jSJ). The infinite sum in the normal¬ 
izing constant then vanishes, resulting in a simpler density. In the following comparisons, we 
draw the reader’s attention to the fact that both the Mardia-Sutton and Kato-Shimizu models 
are defined over M x C\ with C\ the unit circle in R 2 , whereas the above-mentioned densities are 
defined over R + x C\. 


( 00 

4o(«i)/oM + 2j] 
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Table 1: Maximum likelihood estimates, maximized log-likelihood (MLL), Akaike Information 
Criterion (AIC) and Bayesian Information Criterion (BIC) values for the Weibull sine-skewed 
von Mises (WeiSSVM) and its competitor models, the Generalized Gamma sine-skewed von 
Mises (GGSSVM), the Johnson-Wehrly (JW), the independence (Indep.), Mardia-Sutton (MS) 
and Kato-Shimizu (KS) models, fitted to the blue periwinkle data. 


Distributions 

a 

A 

7 

A 

k 

A 



MLL 

AIC 

BIC 

WeiSSVM 

2.01 

0.05 


-1.90 

1.68 

1.00 



-168.57 

347.13 

354.30 

GGSSVM 

2.00 

0.05 

2.04 

-1.90 

1.70 

1.00 



-168.56 

349.13 

357.73 

JW 


0.10 


-1.70 

1.44 




-182.93 

371.86 

376.16 

Indep. 

1.54 

0.02 


-2.97 


1.00 



-187.25 

382.50 

388.24 


A 

G 

A 

V 

At 

A 2 

ki 

£2 

MLL 

AIC 

BIC 

MS 

28.58 

24.43 

29.63 

-2.11 

-1.52 


2.59 


-176.88 

365.75 

374.36 

KS 

28.58 

24.43 

29.63 

-2.11 

-0.97 

0.73 

8.16 

3.46 

-168.46 

352.93 

364.40 


5.1 Periwinkle data 

We give an analysis of n = 31 observations which consist of the movements of blue periwinkles 
after they had been transplanted downshore from the height at which they normally live. The 
data set was taken from Table 1 of [J; see that paper for details about the experience. 

A visual inspection of the data points in Figure^] reveals that the concentration of the circular 
part tends to increase with length, which is precisely one of the features that the WeiSSVM model 
can well incorporate. Moreover, m have shown that, on basis of the Pewsey test of symmetry 
(see m), the circular part of the data is asymmetric. 

Table Q] presents the maximum likelihood estimates, maximized log-likelihood, Akaike and 
Bayesian Information Criterion values obtained from all models under investigation. As we can 
see, the location parameters of the GGSSVM and its submodels are close (note that the location 
of the Independence model is —2.97 + n/2 = —1.40, as explained at the end of Section [2]) and 
the WeiSSVM has the lowest AIC and BIC values. It clearly improves on Johnson-Wehrly 
and Mardia-Sutton, and even on the flexible Kato-Shimizu model. It is quite remarkable to 
notice the tiny difference in the maximized log-likelihood between WeiSSVM and the embedding 
model, the GGSSVM. The likelihood ratio test for the Johnson-Wehrly submodel (w.r.t. the 
WeiSSVM) takes value Tjw = —2(—182.93 + 168.57) = 28.72, with p-value = 0.00, which 
emphatically rejects the Johnson-Wehrly model. Even stronger, the independence test yields 
Tindep = —2(—187.25 + 168.57) = 37.36, stressing the dependence between the angular and the 
linear part. 

As a conclusion, our WeiSSVM model (with 5 parameters) is a good-fitting and parsimonious 
model for the periwinkle data set. For visual impression, we have superimposed the contour plot 
of the fitted WeiSSVM model on a scatter plot of the data in the panel making up Figure [4j 

5.2 Wind direction and temperature data 

As second example, we consider the original data set from m, consisting of 28 measurements of 
wind direction and temperature at Kew during the period 1956-1960. The data are taken from 
Table 1 in [T5] and illustrated in Figure [5) Although the effect noticed for the periwinkle data, 
namely high concentration for high linear values, is less marked here, Mardia and Sutton have 
noted (and established) a strong dependence between the circular and the linear component. 
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Figure 4: Contour plot of the blue periwinkle data (in lengths and radians), together with the 
fitted WeiSSVM density. The data are plotted over [0,125) x [—7r, 7r), with the distance indicated 
in cm. 


It has been shown in HD that the Mardia-Sutton model is extremely good for this data set; 
it is therefore very interesting to compare it with the WeiSSVM. Tabic [2] contains the maximum 
likelihood estimates, maximized log-likelihood, Akaike and Bayesian information criterion values. 
The (circular) location parameters of our proposed models are almost the same (again, the 
location of the Independence model is —0.62 + 7t/2 = 0.95). We see that our WeiSSVM model 
best incorporates the non-trivial behavior of this data set (its AIC and BIC values are clearly 
below that of the MS model), and again it is much better than the Johnson-Wehrly model (which 
is clearly rejected as submodel). A contour plot of the fitted WeiSSVM model with a scatter 
plot of the data is provided in Figure [5j We finally note that the independence test of course 
heavily rejects (p -value = 0.00) the null of independence, hereby agreeing with [15] . 

6 Future research and a directional-linear extension 

Given its very good fitting capacities and simple parameter interpretation, the WeiSSVM is a 
viable model to investigate in detail further data sets. Two concrete examples shall be elucidated 
in the future. The first concerns ecological data related to trees. Indeed, [lj have only used the 
direction of fallen logs, hence a pure circular setting, to model the influence of neighborhood 
structure and directionality of radiation on crown asymmetry; a more detailed analysis can be 
obtained by adding as linear part the distance to each neighboring tree. The second data set 
concerns cylindrical data consisting of the burnt area and the direction of wildfires in Portugal, 
as analyzed in [3] and [5]. Our parametric model will be an interesting alternative especially to 
the non-parametric approach of the latter paper. 

The latter data are both circular-linear and directional-linear, hence requiring an extension of 
the WeiSSVM to the directional-linear setting. We shall now briefly indicate how straightforward 
this extension is. It is obtained by replacing the circular sine-skewed von Mises density with 
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Table 2: Maximum likelihood estimates, maximized log-likelihood (MLL), Akaike Information 
Criterion (AIC) and Bayesian Information Criterion (BIC) values for the Weibull sine-skewed 
von Mises (WeiSSVM) and its competitor models, the Generalized Gamma sine-skewed von 
Mises (GGSSVM), the Johnson-Wehrly (JW), the independence (Indep.), Mardia-Sutton (MS) 
and Kato-Shimizu (KS) models, fitted to the wind-temperature data. 


Distributions 

a 

P 

7 

A 

k 

A 



MLL 

AIC 

BIC 

WeiSSVM 

10.72 

0.02 


0.54 

1.10 

0.49 



-125.70 

261.39 

268.05 

GGSSVM 

10.78 

0.02 

10.64 

0.54 

1.09 

0.49 



-125.69 

263.39 

271.38 

JW 


0.03 


0.83 

0.60 




-180.77 

367.55 

371.54 

Indep. 

8.90 

0.02 


-0.62 


0.78 



-134.32 

276.64 

281.97 


A 

a 

A 

0 

At 

A 2 

ki 

£2 

MLL 

AIC 

BIC 

MS 

42.07 

4.86 

5.01 

0.36 

0.88 


1.14 


-128.10 

268.19 

276.19 

KS 

42.07 

4.86 

5.01 

0.36 

1.09 

0.48 

1.02 

0.53 

-126.66 

269.32 

279.98 


its equivalent on unit spheres S k 1 = {v € K k : ||v|| = 1}, k > 3, recently defined in m- 
The cosine part simply becomes the scalar product O'/j. between 6 G S k 1 and the location 
parameter fx G while A sin(0 — n) is expressed as y 7 1 — (6'ix) 2 X'S ll (0), A € where 

Sfj = (0 — (6'n)n)/\\6 — (0'fx)n\\ is the multivariate sign vector on the unit sphere. We refer to 
m for further information. 

The density of the Weibull sine-skewed Fisher-von Mises-LangevirQ, in short WeiSSFVML, 
distribution on x M + , for the directional part with respect to the usual surface area measure 
da k - 1 , is defined as 


(0, x ) f(0, x ) = C k (l + y/l- (0» 2 A'S M (0)) x a 1 exp [~(/3x) a ( 1 - tanh(K)0V)] • (9) 

The normalizing constant of the distribution 0 is simply given by 

a/3"(sinh(«:))i fc / 2 i 1 


C k = 


(27r) fc / 2 cosh(K)P^ 2 ^ 2 2 ^ (cosh(ft)) 


Indeed 


's k ~ x Jo 


+ y/l - (0» 2 A'S m ( 0)) /- 1 exp [—(/3x) Q ( 1 — tanh {k)Q' ix)\ dxda k _i(0) 

/• /*oo 

/ / x Q_1 exp [— (/3x) a (l — tanh(«;)0 / /x)l dxda k -i(0) 

J s k ~ 1 Jo 

r /*1 roo 

/ / / x Q_1 exp [— (f3x) a (l — tanh(K)t)] dxda k - 2 {v)(\ — t 2 / k ~ 3 ^ 2 dt 

J -1 J o 

/■l (! _ £2)(fc-3)/2 


27r fc / 2 

a/3 Q r(A:/2)P(l/2, (A; — l)/2) J_i 1 — tanh(/t)f 
(27r) fc / 2 cosh(K)P^T^ 2 2 ^ (cosh(/t)) 


fc/ 2-2 
ct/3 Q (sinh(/t))( fc / 2 ) 


-l 


1 In higher dimensions, the VM is called Fisher-von Mises-Langevin and hence abbreviated FVML. 
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Figure 5: Contour plot of the wind and temperature data (in lengths and radians), together 
with the fitted WeiSSVM density. The data are plotted over [30, 60) x [—7r, 7r), with temperature 
indicated in Fahrenheit. 


where £?(•,•) denotes the beta function. We have used above the change of variables formula 
dak-i(0) = (1 — t 2 )( fc - 3 )/ 2 dcrfc_ 2 (v)<if where v € S k ~ 1 (fJ,- L ) = {v € M k : ||v|| = l.v'/i = 0}, 
the equality uik-i = Wk/B( 1/2, ( k — l)/2) (with = 2ir k / 2 /T(k/2) the surface area measure of 
5 fc-1 ) as well as, like for the result of Section 5 in [10j . the following relationship of the associated 
Legendre function (equation 8.711.1 of [7], p. 969) 




f 1 a dt 

2> Jj y/irr(fA + |) U ( z + tVz 2 — l)^~ u 


[K/i > -1/2, 


arg(z ±1)| < 7r], 


Clearly, the distribution ([9]) reduces to (H|) when k = 2, and as in [10], the distribution also 
has a simpler form when k = 3, namely, 


f{0,x) = (l + V 1 ~ x a lex P [~(^) Q (1 - tanh(«;)^ , /i)] . 

Investigating stochastic properties as well as estimation procedures is beyond the scope of the 
present paper and left for future research. 
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